Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Issues in Indian languages computing in particular reference to search and retrieval in Telugu language

Identifieur interne : 000953 ( Main/Exploration ); précédent : 000952; suivant : 000954

Issues in Indian languages computing in particular reference to search and retrieval in Telugu language

Auteurs : Devika P. Madalli [Inde] ; Dimple Patel [Inde]

Source :

RBID : ISTEX:E12736709606DB346F098D7972E67B9F57E72FA7

Abstract

Purpose The purpose of this paper is to discuss the various issues involved in Indian languages computing, particularly Telugu, like creating, displaying, searching and retrieving digital content. The paper also aims to emphasize the issues involved in retrieval in Indian languages. The complexities presented by the grammar, syntax and morphology of Indian languages are discussed. Designmethodologyapproach The paper undertakes and presents descriptive study of the issues and challenges in Indian languages computing in general and Telugu language in particular. Findings The problem of multilingual information retrieval in Indian languages is multipronged. A major observation of this study is that, though digital content is available in Indian languages, it is mostly in nonstandard encoding format and fonts. There is an urgent need to work in the area of developing search algorithms for Indian languages, like soundex and metaphones to tolerate spelling variations and mistakes that a user might make in queries and suggest correct spellings. Practical implications With existing technologies libraries can now build online catalogues in the language of the documents or build digital repositories with content in various Indian languages. Though a few library automation software like NewGenLib and digital library software like DSpace, etc. are offering Unicode support for Indian languages, they do not allow for different types of search such as truncation search, word variants, etc. The present study is a step towards developing algorithms for indexing and searching in Indian languages. Originalityvalue The paper addresses various issues in Indian language computing with emphasis on search and retrieval.

Url:
DOI: 10.1108/07378830910988568


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Issues in Indian languages computing in particular reference to search and retrieval in Telugu language</title>
<author>
<name sortKey="Madalli, Devika P" sort="Madalli, Devika P" uniqKey="Madalli D" first="Devika P." last="Madalli">Devika P. Madalli</name>
</author>
<author>
<name sortKey="Patel, Dimple" sort="Patel, Dimple" uniqKey="Patel D" first="Dimple" last="Patel">Dimple Patel</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:E12736709606DB346F098D7972E67B9F57E72FA7</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1108/07378830910988568</idno>
<idno type="url">https://api.istex.fr/document/E12736709606DB346F098D7972E67B9F57E72FA7/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000500</idno>
<idno type="wicri:Area/Istex/Curation">000493</idno>
<idno type="wicri:Area/Istex/Checkpoint">000475</idno>
<idno type="wicri:doubleKey">0737-8831:2009:Madalli D:issues:in:indian</idno>
<idno type="wicri:Area/Main/Merge">000961</idno>
<idno type="wicri:Area/Main/Curation">000953</idno>
<idno type="wicri:Area/Main/Exploration">000953</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Issues in Indian languages computing in particular reference to search and retrieval in Telugu language</title>
<author>
<name sortKey="Madalli, Devika P" sort="Madalli, Devika P" uniqKey="Madalli D" first="Devika P." last="Madalli">Devika P. Madalli</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Inde</country>
<wicri:regionArea>Documentation Research and Training Centre, Indian Statistical Institute, Bangalore</wicri:regionArea>
<wicri:noRegion>Bangalore</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Patel, Dimple" sort="Patel, Dimple" uniqKey="Patel D" first="Dimple" last="Patel">Dimple Patel</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Inde</country>
<wicri:regionArea>Department of Library & Information Science, Osmania University, Hyderabad</wicri:regionArea>
<wicri:noRegion>Hyderabad</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Library Hi Tech</title>
<idno type="ISSN">0737-8831</idno>
<imprint>
<publisher>Emerald Group Publishing Limited</publisher>
<date type="published" when="2009-09-04">2009-09-04</date>
<biblScope unit="volume">27</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="450">450</biblScope>
<biblScope unit="page" to="459">459</biblScope>
</imprint>
<idno type="ISSN">0737-8831</idno>
</series>
<idno type="istex">E12736709606DB346F098D7972E67B9F57E72FA7</idno>
<idno type="DOI">10.1108/07378830910988568</idno>
<idno type="filenameID">2380270310</idno>
<idno type="original-pdf">2380270310.pdf</idno>
<idno type="href">07378830910988568.pdf</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0737-8831</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract">Purpose The purpose of this paper is to discuss the various issues involved in Indian languages computing, particularly Telugu, like creating, displaying, searching and retrieving digital content. The paper also aims to emphasize the issues involved in retrieval in Indian languages. The complexities presented by the grammar, syntax and morphology of Indian languages are discussed. Designmethodologyapproach The paper undertakes and presents descriptive study of the issues and challenges in Indian languages computing in general and Telugu language in particular. Findings The problem of multilingual information retrieval in Indian languages is multipronged. A major observation of this study is that, though digital content is available in Indian languages, it is mostly in nonstandard encoding format and fonts. There is an urgent need to work in the area of developing search algorithms for Indian languages, like soundex and metaphones to tolerate spelling variations and mistakes that a user might make in queries and suggest correct spellings. Practical implications With existing technologies libraries can now build online catalogues in the language of the documents or build digital repositories with content in various Indian languages. Though a few library automation software like NewGenLib and digital library software like DSpace, etc. are offering Unicode support for Indian languages, they do not allow for different types of search such as truncation search, word variants, etc. The present study is a step towards developing algorithms for indexing and searching in Indian languages. Originalityvalue The paper addresses various issues in Indian language computing with emphasis on search and retrieval.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Inde</li>
</country>
</list>
<tree>
<country name="Inde">
<noRegion>
<name sortKey="Madalli, Devika P" sort="Madalli, Devika P" uniqKey="Madalli D" first="Devika P." last="Madalli">Devika P. Madalli</name>
</noRegion>
<name sortKey="Patel, Dimple" sort="Patel, Dimple" uniqKey="Patel D" first="Dimple" last="Patel">Dimple Patel</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000953 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000953 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:E12736709606DB346F098D7972E67B9F57E72FA7
   |texte=   Issues in Indian languages computing in particular reference to search and retrieval in Telugu language
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024